Locality Sensitive Hashing, Jaccard Similarity, Duplicate Detection, Document Clustering

Efficient and accurate search in petabase-scale sequence repositories
nature.comยท2dยท
Discuss: Hacker News
๐Ÿ”„Burrows-Wheeler
Sorting encrypted data without decryption: a practical trick
dev.toยท4hยท
Discuss: DEV
๐Ÿ”Hash Functions
An enough week
blog.mitrichev.chยท23hยท
๐Ÿ“ˆLinear programming
Nearest Neighbor CCP-Based Molecular Sequence Analysis
arxiv.orgยท15h
๐Ÿ”„Burrows-Wheeler
DupeGuru lets you quickly find and remove duplicate files from your drives
techspot.comยท1d
๐Ÿ”„Content Deduplication
Homomorphism Problems in Graph Databases and Automatic Structures
arxiv.orgยท15h
๐Ÿ”—Graph Isomorphism
[R] DeepSeek 3.2's sparse attention mechanism
reddit.comยท15hยท
๐ŸŒ€Brotli Internals
Explicit Lossless Vertex Expanders!
gilkalai.wordpress.comยท9h
๐Ÿ’ŽInformation Crystallography
Indexing, Hashing
dev.toยท1dยท
Discuss: DEV
๐Ÿš€Query Optimization
Mind the Gap: Quantifying Vocabulary Mismatch in E-Commerce Site Search
searchhub.ioยท1dยท
Discuss: Hacker News
๐Ÿ“ˆSearch Quality
RND1: Simple, Scalable AR-to-Diffusion Conversion
radicalnumerics.aiยท23hยท
Discuss: Hacker News
๐Ÿ’ปLocal LLMs
Contrastive Weak-to-strong Generalization
arxiv.orgยท15h
โง—Information Bottleneck
MetaGraph: Scalable annotated de Bruijn graphs for DNA indexing and alignment
github.comยท1dยท
Discuss: Hacker News
๐Ÿ”„Burrows-Wheeler
Fast-Convergent Proximity Graphs for Approximate Nearest Neighbor Search
arxiv.orgยท2d
๐Ÿ“Range Queries
Automated Copyright Infringement Detection via Semantic Fingerprinting and Dynamic Thresholding
dev.toยท1dยท
Discuss: DEV
๐Ÿ‘๏ธPerceptual Hashing
An enough week
blog.mitrichev.chยท23hยท
๐ŸงฎZ3 Solver
Writing regex is pure joy. You can't convince me otherwise.
triangulatedexistence.mataroa.blogยท17hยท
โœ…Format Verification
Parameterized Complexity of s-Club Cluster Edge Deletion
arxiv.orgยท1d
๐ŸงฎKolmogorov Complexity
The Library Method: Understanding @cache
dev.toยท18hยท
Discuss: DEV
โšกCache Theory
Relational Transformer: Toward Zero-Shot Foundation Models for Relational Data
arxiv.orgยท1d
๐Ÿง Learned Indexes